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Abstract 

In this paper, we introduce a novel rule for synthesis of reactive systems, applicable to systems made 
of n components which have each their own objectives. It is based on the notion of admissible strategies. 
We compare our novel rule with previous rules defined in the literature, and we show that contrary to 
the previous proposals, our rule dehnes sets of solutions which are rectangular. This property leads to 
solutions which are robust and resilient. We provide algorithms with optimal complexity and also an 
abstraction framework. 


1 Introduction 

The automatic synthesis of reactive systems has recently attracted a considerable attention. The theoretical 
foundations of most of the contributions in this area rely on two-player zero sum games played on graphs: one 
player (player 1) models the system to synthesize, and the other player (player 2) models its environment. 
The game is zero-sum: the objective of player 1 is to enforce the specification of the system while the objective 
of player 2 is the negation of this specification. This is a worst-case assumption: because the cooperation of 
the environment cannot be assumed, we postulate that it is antagonistic. 

A fully adversarial environment is usually a bold abstraction of reality. Nevertheless, it is popular because 
it is simple and sound: a winning strategy against an antagonistic player is winning against any environment 
which pursues its own objective. But this approach may fail to find a winning strategy even if there exist 
solutions when the objective of the environment is taken into account. Also, this model is for two players 
only: system vs environment. In practice, both the system and the environment may be composed of several 
parts to be constructed individually or whose objectives should be considered one at a time. It is thus crucial 
to take into account different players’ objectives when synthesizing strategies; accordingly, alternative notions 
have been proposed in the literature. 

A first classical alternative is to weaken the winning condition of player 1 using the objective of the 
environment, requiring the system to win only when the environment meets its objective. This approach 
together with its weaknesses have been discussed in [3], we will add to that later in the paper. A second 
alternative is to use concepts from n-players non-zero sum games. This is the approach taken both by 
assume-guarantee synthesis 0 (AG), and by rational synthesis [IB] (RS). AG relies on secure equilibria [5] 
(SE), a refinement of Nash equilibria [5^ (NE). In SE, objectives are lexicographic: players first try to 
maximize their own specifications, and then try to falsify the specifications of others. It is shown in [8] 
that SE are those NE which represent enforceable contracts between the two players. In RS, the system is 
assumed to be monolithic and the environment is made of components that are partially controllable. In RS, 
we search for a profile of strategies where the system ensures its objective and the players that model the 
environment are given an “'acceptable” strategy profiles, from which it is assumed that they will not deviate. 
“Acceptable” is formalized either by NE, dominating strategies (Dorn), or subgame perfect equilibria (SPE). 
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Contributions. As a first and central contribution, we propose a novel notion of synthesis where we take 
into account different players’ objectives using the concept of admissible strategies [ms]. For a player with 
objective (j), a strategy a is dominated by cr' if cr' does as well as a w.r.t. (j) against all strategies of the other 
players, and better for some of those strategies. A strategy a is admissible if it is not dominated by another 
strategy. In [5] , the admissibility notion was lifted to games played on graphs, and algorithmic questions left 
open were solved in [3] , with the goal of model checking the set of runs that survive the iterative elimination 
of dominated strategies. Here, we use this notion to derive a meaningful notion to synthesize systems 
with several players, with the following idea. Rational players should only play admissible strategies since 
dominated strategies are clearly suboptimal. In assume-admissible synthesis (AA), we make the assumption 
that players play admissible strategies. Then for each player, we search for an admissible strategy that is 
winning against all admissible strategies of other players. AA is sound: any strategy profile that is winning 
against admissible strategies of other players, satisfies the objectives of all the players (Theorem [T]) . 

As a second contribution, we compare the different synthesis rules. First we apply all the rules on a 
simple but representative example, and show the main advantages of AA w.r.t. the other rules. Then we 
compare systematically the different approaches. We show when a solution for one rule implies a solution for 
another rule and we prove that, contrary to other rules, AA yields rectangular sets of solutions (Theorem[3]). 
We argue that the rectangularity property is essential for practical applications. 

As a third contribution, we provide algorithms to decide the existence of assume-admissible winning 
strategy profiles and prove the optimal complexity of our algorithm (Theorem 3]): PS PACE-complete for 
Muller, and PTIME for Biichi objectives. 

As a last important contribution, we provide an abstraction framework which allows us to define sufficient 
conditions to compute sets of winning assume-admissible strategies for each player in the game composition- 
ally (Theorem [5]) . 

Additional pointers to related works. We have already mentioned assume-guarantee synthesis [6] and 
rational synthesis [MIEI]. Those are the closest related works to ours as they pursue the same scientific 
objective: to synthesis strategy profiles for non-zero sum multi-player games by taking into account the 
specification of each player. As those works are defined for similar formal setting, we are able to provide 
formal statements in the core of the paper that add elements of comparison with our work. 

In [15], Faella studies several alternatives to the notion of winning strategy including the notion of 
admissible strategy. His work is for two-players only, and only the objective of one player is taken into 
account, the objective of the other player is left unspecified. Faella uses the notion of admissibility to define 
a notion of best-effort in synthesis while we use the notion of admissibility to take into account the objectives 
of the other players in an n player setting where each player has his own objective. 

The notion of admissible strategy is definable in strategy logics [9||24] and decision problems related 
to the AA rule can be reduced to satisfiability queries in such logics. Nevertheless this would not lead to 
worst-case optimal algorithms. Based on our previous work |4], we develop in this paper worst-case optimal 
algorithms. 

In [12| . Damm and Finkbeiner use the notion of dominant strategy to provide a compositional semi¬ 
algorithm for the (undecidable) distributed synthesis problem. So while we use the notion of admissible 
strategy, they use a notion of dominant strategy. The notion of dominant strategy is strictly stronger, every 
dominant strategy is admissible but an admissible strategy is not necessary dominant. Also, in multiplayer 
games with omega-regular objectives with complete information (as considered here), admissible strategies 
are always guaranteed to exist [5] while it is not the case for dominant strategies. We will show in an example 
that the notion of dominant strategy is too strong for our purpose. Also, note that the objective of Damm 
and Finkbeiner is different from ours: they use dominance as a mean to formalize a notion of best-effort for 
components of a distributed system w.r.t. their common objective, while we use admissibility to take into 
account the objectives of the other components when looking for a winning strategy for one component to 
enforce its own objective. Additionally, our formal setting is different from their setting in several respects. 
First, they consider zero-sum games between a distributed team of players (processes) against a unique 
environment, each player in the team has the same specification (the specification of the distributed system 
to synthesize) while the environment is considered as adversarial and so its specification is the negation of 
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the specification of the system. In our case, each player has his own objective and we do not distinguish 
between protagonist and antagonist players. Second, they consider distributed synthesis: each individual 
process has its own view of the system while we consider games with perfect information in which all players 
have a complete view of the system state. Finally, let us point out that Damm and Finkbeiner use the term 
admissible for specifications and not for strategies (as already said, they indeed consider dominant strategies 
and not admissible strategies). In our case, we use the notion of admissible strategy which is classical in 
game theory, see e.g. [mil]. This vocabulary mismatch is unfortunate but we decided to stick to the term 
of “admissible strategy” which is well accepted in the literature, and already used in several previous works 
on (multi-player) games played on graphs [H [TSl 0] . 

Structure of the paper. Sect. 2 contains definitions. In Sect. 3, we review synthesis rules introduced in 
the literature and define assume-admissible synthesis. In Sect. 4, we consider an example; this allows us to 
underline some weaknesses of the previous rules. Sect. 5 presents a formal comparison of the different rules. 
Sect. 6 contains algorithms for Biichi and Muller objectives, and Sect. 7 abstraction techniques applied to 
our rule. 


2 Definitions 

A turn-based multiplayer arena is a tuple A = {V, (S^jig-p, Sinit, (Actijig-p, S) where 7^ is a finite set of players; 
for i € V, Si is a finite set of player-z states; we let S = 1+Jjgp S^; Sinit G S is the initial state; for every i € V, 
Acti is the set of player-z actions; we let Act = Uigp Act^; and S: S x Act S is the transition function. A 
run p is a sequence of alternating states and actions p = siaiS 2 a 2 ... S (S • Act)"^ such that for all z > I, 
<J(si,ai) = Si+i. We write pi = Si, and acti(p) = a^. A history is a finite prefix of a run ending in a state. 
We denote by p<k the history sioi ... Sk', and write last(p<fe) = Sk, the last state of the history. The set of 
states occurring infinitely often in a run p is Inf(p) = {s G S | Vj G N. 3z > j, pi = s}. 

A strategy of player z is a function Oi : (S* ■ Si) —>■ Actp A strategy profile for the set of players P QV is 
a tuple of strategies, one for each player of P. We write —z for the set V \ {z}. Let Si(A) be the set of the 
strategies of player z in A, written if A is clear from context, and Sp the strategy profiles oi P QV. 

A run p is compatible with strategy cr for player z if for all j > Pj € Si implies act j{p) = a{p<j). It 
is compatible with strategy profile a-p if it is compatible with each ai for i G P. The outcome of a strategy 
profile CTp is the unique run compatible with crp starting at Sinit, denoted OutA(crp). We write OutA,s(CTp) 
for the outcome starting at state s. Given ap G 'Ep with P C 7^, let Out a (up) denote the set of runs 
compatible with ap, and extend it to OutA(S') where S' is a set of strategy profiles. For E C Si x Acti, let 
Strati (A) denote the set of player-z strategies tr whose compatible outcomes use action a from a state s only 
if (s, a) G E. 

An objective 0 is a subset of runs. A strategy Ui of player z is winning for objective fii if for all cr_i G S_i, 
OutA(cri, (T_i) G (pi- A game is an arena equipped with an objective for each player, written G = (A, {(j)i)i^p) 
where for each player z, pi is an objective. Given a strategy profile up for the set of players P, we write 
G,CTp \= (j) if OutA(CTp) C (j). We write OutG(crp) = OutA(crp), and Outg = OutG(S) for S C Si. For any 
coalition C CP, and objective (j), we denote by Winc(A, fi) the set of states s such that there exists uc G Sc 
with OutG,s(crc) C (j). 

Although we prove some of our results for general objectives, we give algorithms for w-regular objectives 
represented by Muller conditions. A Muller condition is given by a family P of sets of states: pi = {p \ 
Inf(/9) G P}. Following [1^, we assume that P is given by a Boolean circuit whose inputs are S, which 
evaluates to true exactly on valuations encoding subsets S G P. We also use linear temporal logic (LTL) [27] 
to describe objectives. LTL formulas are defined by (j) := Gfi \ Ffi \ Xfi \ (pUfi \ p\l<p \ S where S C S 
(We refer to [14] for the semantics.) We consider the special case of Biichi objectives, given by GF(i?) = 
{p \ B D Inf(p) 7 ^ 0}. Boolean combinations of formulas GF(S') define Muller conditions representable by 
polynomial-size circuits. 

In any game G, a player i strategy Ui is dominated by tr' if for all a^i G S_j, G,ai,a-i ^ pi implies 
G,a),a-i \= pi and there exists cr_i G S_j, such that G,cr',cr_i ^ pi and G,tTi,tT_i ^ pi, (this is classically 
called weak dominance, but we call it dominance for simplicity). A strategy which is not dominated is 


3 


admissible. Thus, admissible strategies are maximal, and incomparable, with respect to the dominance 
relation. We write Admi(G) for the set of admissible strategies in and Admp(G) = Oigp 
product of the sets of admissible strategies for P C P. 

Strategy ai is dominant (Dom) if for all a[ and (T_i, G,(T',(T_i \= (j)i implies G,cri,cr_i ^ (fti. The set of 
dominant strategies for player i is written Domi(G). A Nash equilibrium (NE) for G is a strategy profile a-p 
such that for all i S P, and cr' G E^, ^ (pi implies G,(Tp \= pi] thus no player can improve its 

outcome by deviating from the prescribed strategy. A Nash equilibrium for G from s, is a Nash equilibrium 
for G where the initial state is replaced by s. A subgame-perfect equilibrium (SPE) for G is a strategy 
profile (Tp such that for all histories h, {ui oh)i^p is a Nash equilibrium in G from state last(h), where given 
a strategy a, a oh denotes the strategy last(/i) ■ h' a{h ■ h'). 

3 Synthesis Rules 

In this section, we review synthesis rules proposed in the literature, and introduce a novel one: the assume- 
admissible synthesis rule (AA). Unless stated otherwise, we fix for this section a game G, with players 
V — {1,...,n} and their objectives pi,... ,pn- 

Rule Goop: The objectives are achieved cooperatively if there is a strategy profile up = (ci, tT 2 , • ■ • > such 
that G, (Tp ^ f\i(zp pi- 

This rule [231 [ID] asks for a strategy profile that jointly satisfies the objectives of all the players. This 
rule makes very strong assumptions: players fully cooperate and strictly follow their respective strategies. 
This concept is not robust against deviations and postulates that the behavior of every component in the 
system is controllable. This weakness is well-known: see e.g. [6] where the rule is called weak co-synthesis. 
Rule Win. The objectives are achieved adversarially if there is a strategy profile trp = {ai,..., an) such 
that for all i G V, G,ai \= pi. 

This rule does not require any cooperation among players: the rule asks to synthesize for each player i 
a strategy which enforces his/her objective pi against all possible strategies of the other players. Strategy 
profiles obtained by Win are extremely robust: each player is able to ensure his/her objective no matter how 
the other players behave. Unfortunately, this rule is often not applicable in practice: often, none of the 
players has a winning strategy against all possible strategies of the other players. The next rules soften this 
requirement by taking into account the objectives of other players. 

Rule Win-under-Hyp: Given a two-player game G with V = {1,2} in which player 1 has objective pi, 
player 2 has objective p 2 , player 1 can achieve adversarially pi under hypothesis p 2 , if there is a strategy ai 
for player 1 such that G, cti \= p 2 ^ pi- 

The rule winning under hypothesis applies for two-player games only. Here, we consider the synthesis 
of a strategy for player 1 against player 2 under the hypothesis that player 2 behaves according to his/her 
specification. This rule is a relaxation of the rule Win as player 1 is only expected to win when player 2 plays 
so that the outcome of the game satisfies p 2 . While this rule is often reasonable, it is fundamentally plagued 
by the following problem: instead of trying to satisfy pi, player 1 could try to falsify p 2 , see e.g. [3]. This 
problem disappears if player 2 has a winning strategy to enforce p 2 , and the rule is then safe. We come back 
to that later in the paper (see Lemma 1). 

Chatterjee et al. in [6] proposed synthesis rules inspired by Win-under-Hyp but avoid the aforementioned 
problem. The rule was originally proposed in a model with two components and a scheduler. We study here 
two natural extensions for n players. 

Rules AG^ and AG^: The objectives are achieved by 

(AG^) assume-guarantee-A if there exists a strategy profile ap such that 

1. G, ap \= pi, 2. for all players i, G, ai \= if\j(zp\{i} (pj) ^ </i- 
(AG^) assume- guarantee-\^ if there exists a strategy profile ap such that 

1. G, ap \= pi, 2. for all players i, G, ai \= (Vje-p\{i} 4'i) ^ 4'i- 

^This rule was introduced in [3, under the name Doomsday equilibria, as a generalization of the AG rule of [6] to the case of 
n-players. 
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The two rules differ in the second requirement: AG^ requires that player i wins whenever all the other 
players win, while AG^ requires player i to win whenever one of the other player wins. Clearly AG^ is 
stronger, and the two rules are equivalent for two-player games. As shown in [S], for two-player games, 
a profile of strategy for AG^ (or AG'^) is a Nash equilibrium in a derived game where players want, in 
lexicographic order, first to satisfy their own objectives, and then as a secondary objective, want to falsify 
the objectives of the other players. As NE, AG" and AG'^ require players to synchronize on a particular 
strategy profiles. As we will see, this is not the case for the new rule that we propose. 

m and m introduce two versions of rational synthesis (RS). In the two cases, one of the player, say 
player 1, models the system while the other players model the environment. The existential version (RS^) 
searches for a strategy for the system, and a profile of strategies for the environment, such that the objective 
of the system is satisfied, and the profile for the environment is stable according to a solution concept which 
is either NE, SPE, or Dom. The universal version (RS'^) searches for a strategy for the system, such that 
for all environment strategy profiles that are stable according to the solution concept, the objective of the 
system holds. We write resp. strategy proHles (t_i = (cr 2 , ..., cr„) that are 

NE (resp. SPE) equilibria in the game G when player 1 plays CTi, and for the set of strategy profiles 

(T_i where each strategy aj, 2 < j < n, is dominant in the game G when player 1 plays cti. 

Rules RS^’^(NE,SPE,Dom): Let 7 G {NE, SPE, Dom}, the objective is achieved by: 

(RS^( 7 )) existential rational synthesis under 7 if there is a strategy Ui of player 1 , and a profile (t_i G , 
such that G, tJi, a_i ^ (j>i. 

(RS^( 7 )) universal rational synthesis under 7 if there is a strategy a\ of player 1 , such that 
for all (T_i G G, tri, cr_i ^ (j)i. 

Clearly, (RS'^( 7 )) is stronger than (RS^( 7 )) and more robust. As RS^’'^(NE, SPE) are derived from NE and 
SPE, they require players to synchronize on particular strategy profiles. 

Novel rule We now present our novel rule based on the notion of admissible strategies. 

Rule AA: The objectives are achieved by assume-admissible (/\A) strategies if there is a strategy prohle cr-p 
such that: 1. for all i gP, G Adm,i(G); 2. for all i gP, G Adm_i(G). G,cr(_-,cri \= (j)^. 

A player-i strategy satisfying conditions 1 and 2 above is called assume-admissible-winning (AA-winning). 
A profile of AA-winning strategies is an AA-winning strategy profile. The rule AA requires that each player 
has a strategy winning against admissible strategies of other players. So we assume that players do not play 
strategies which are dominated, which is reasonable as dominated strategies are clearly suboptimal options. 

Contrary to Coop, AG^, and AG'^, AA does not require that the strategy profile is winning for each player. 
As for Win, this is a consequence of the definition: 

Theorem 1. For all AA-winning strategy profile a-p, G,ap |= 

The condition that AA strategies are admissible is necessary for Thm. [T] it does not suffice to have 
strategies that are winning against admissible strategies (see Appendix). 

4 Synthesis Rules at the Light of an Example 

We illustrate the synthesis rules on an example of a real-time scheduler with two tasks. The system is 
composed of Sched (player 1) and Env (player 2). Env chooses the truth value for ri,r 2 (n is a request for 
task i), and Sched controls qi, q 2 {qi means that task i has been scheduled). Our model is a turn-based game: 
first, Env chooses a value for ri,r 2 , then in the next round Sched chooses a value for qi,q 2 , and we repeat 
forever. The requirements for Sched and Env are as follows: 1. Sched is not allowed to schedule the two tasks 
at the same time. When ri is true, then task 1 must be scheduled (gi) within three rounds. When r 2 is true, 
task 2 must be scheduled ( 92 ) in exactly three rounds. 2. Whenever Env issues then it does not issue this 
request again before the occurrence of the grant g^. Env issues infinitely many requests ri and r 2 . We say 
that a request is pending whenever the corresponding grant has not yet been issued. Those requirements 
can be expressed in LTL as follows: 
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• 0Sched = G{ri X 91 V XXXgi) A G(r 2 XXXga) A G^(gi A 92 )- 

• 4>Em = G(ri ^ X(-'riWq'i)) A G(r 2 ^ 7.{^r2^q2)) A (GFri) A (GFr 2 ). 

A solution compatible with the previous rules in the literatnre. First, we note that there is no 
winning strategy neither for Sched, nor for Env. In fact, first let di be the strategy of Sched that never 
schedules any of the two tasks, i.e. leaves qi and q 2 constantly false. This is clearly forcing against all 

strategies of Env. Second, let d 2 be s.t. Env always requests the scheduling of both task 1 and task 2, i.e. 
ri and r 2 are constantly true. It is easy to see that this enforces -'(/'sched against any strategy of Sched. So, 
there is no solution with rule Wird. But clearly those strategies are also not compatible with the objectives 
of the respective players, so this leaves the possibility to apply successfully the other rules. We now consider 
a strategy profile which is a solution for all the rules except for AA. 

Let (cri,(J 2 ) be strategies for player 1 and 2 respectively, such that the outcome of {ai,a 2 ) is ”Env emits 
ri, then Sched emits gi, Env emits r 2 , then Sched waits one round and emits 52 , and repeat.” If a deviation 
from this exact execution is observed, then the two players switch to strategies di and (72 respectively, i.e. to 
the strategies that falsify the specification of the other players. The reader can now convince himself/herself 
that (cti, (J 2 ) is a solution for Coop, AG and RS^(NE, SPE, Dom). Furthermore, we claim that ai is a solution 
for Win-under-Hyp and RS^(NE, SPE, Dom)0 But, assume now that Env is a device driver which requests 
the scheduling of tasks by the scheduler of the kernel of an OS. Clearly ((Ti,(T 2 ), which is compatible with 
all the previous rules (but Win), makes little sense in this context. On the other hand, ())sched and ^Env are 
natural specifications for such a system. So, there is clearly room for other synthesis rules! 

Solutions provided by AA, our novel rule. For Env, we claim that the set of admissible strategies, 
noted Adm(^Env), are exactly those that (i) do not emit a new request before the previous one has been 
acknowledged, and (ii) do always eventually emit a (new) request when the previous one has been granted. 
Indeed as we have seen above, Env and Sched can cooperate to satisfy (psched A ^Env, so any strategy of Env 
which would imply the falsification of (j)Env is dominated and so it is not admissible. Also, we have seen that 
Env does not have a winning strategy for ^Env, so Env cannot do better. 

Now, let us consider the following strategy for Sched. (i) if pending requests ri and r 2 were made one 
round ago, then grant gi; if pending requests ri and r 2 were made three rounds ago, then behave arbitrarily 
(it is no more possible to satisfy the specihcation); (ii) if pending request r 2 was made three rounds ago, but 
not ri, then grant g 2 ; (Hi) if pending ri was made three rounds ago, but not r 2 , then grant gi. We claim 
that this strategy is admissible and while it is not winning against all possible strategies of Env, it is winning 
against all admissible strategies of Env. So, this strategy enforces (()sched against all reasonable strategies of 
Env w.r.t. to his/her own objective (pEm- In fact, there is a whole set of such strategies for Sched, noted 
WinAdmsched- Similarly, there is a whole set of strategies for Env which are both admissible and winning 
against the admissible strategies of Sched, noted WinAdmEnv- We prove in the next section that the solutions 
to AA are rectangular sets', they are exactly the solutions in WinAdmsched x WinAdmEnv This ensures that 
AA leads to resilient solutions: players do not need to synchronize with the other players on a particular 
strategy profile but they can arbitrarily choose inside their sets of strategies that are admissible and winning 
against the admissible strategies of the other players. 

5 Comparison of Synthesis Rules 

In this section, we compare the synthesis rules to understand which ones yield solutions more often, and to 
assess their robustness. Some relations are easy to establish; for instance, rules Win, AG'^, AG^, AA imply 
Goop by definition (and Thm. [TJ. We summarize the implication relations between the rules in Fig. [TJ We 
present the rules AG^, AG^, and the variants of RS'(-) in one group, respectively. A dashed arrow from A to 
B means that rule A implies some rule in B; while a plain arrow means that A implies all rules in B (e.g. AA 

^Also, it is easy to see that Env cioes not have a (dominant strategy for his specification ((details are in Appendix). So, 
considering dominant strategies as best-effort strategies would not lead to a solution for this example. To find a solution, we 
need to take into account the objectives of the other players. 

®The interested reader can find a detailed explanation for all those claims in the appendix. 
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Win 


^ RS^’'^(nE, SPE.Dom) 

1 \ ; 

AA ^ AG^’^ ^ Coop 


Figure 1: Comparison of synthesis rules. 


implies AG^ but not AG'^; while Win implies both rules.) An absence of path means that A does not imply 
any variant of B. Thus the figure explains which approaches yield solutions more often, by abstracting away 
the precise variants. The following theorem states the correctness of our diagram; the appendix contains 
detailed proofs between all pairs of rules. 

Theorem 2. The implication relations of Fig. [7] hold. 

In the controller synthesis framework using two-player games between a controller and its environment, 
some works advocate the use of environment objectives which the environment can guarantee against any 
controller [7]. Under this assumption, Win-under-Hyp implies AA; 

Lemma 1. Let G = be a two-player game. If player 2 has a winning strategy for (j )2 and Win- 

under-Hyp has a solution, then AA has a solution. 

We now consider the robustness of the profiles synthesized using the above rules. An AA-winning strategy 
profile (T-p is robust in the following sense: The set of AA-winning profiles is rectangular, i. e. any combination 
of AA-winning strategies independently chosen for each player, is an AA-winning profile. Second, if one 
replaces any subset of strategies in AA-winning profile a-p by arbitrary admissible strategies, the objectives 
of all the other players still hold. Formally, a rectangular set of strategy profiles is a set that is a Cartesian 
product of sets of strategies, given for each player. A synthesis rule is rectangular if the set of strategy profiles 
satisfying the rule is rectangular. The RS rules require a specific definition since player 1 has a particular 
role: we say that RS'^’^( 7 ) is rectangular if given for any strategy cti witnessing the rule, the set of strategy 
profiles { 02 , ■. ■, <Jn) S such that G, cti, ..., cr„ \= 4>i is rectangular. We show that apart from AA, only 

Win and RS'^(Dom) are rectangular among the other rules: 

Theorem 3. We have 1. Rule AA is rectangular; and for all games G, AA-winning strategy profile ap, coali¬ 
tion C QV, if (Tq G Admc(G), then G,a-c,<y'c 1= f\ie-c rules Win and RS'^(Dom) are rectangu¬ 
lar; the rules Goop, AG'^, AG^, 

RS^(NE,SPE,Dom), and RS'^(NE, SPE) are not rectangular. 

6 Algorithm for Assume-Admissible 

In this section, we recall the characterization of the outcomes 
of admissible strategy profiles given in [4], and we derive al¬ 
gorithms for the A A rule. We use the game of Fig. [2] as a 
running example for this section. Clearly, none of the play¬ 
ers of this game has a winning strategy for his own objec¬ 
tive when not taking into account the objective of the other 
player, but, as we will see, both players have an admissible 
and winning strategy against the admissible strategies of the 
other player, and so the A A rule applies. 

The notion of value associated to the states of a game 
plays an important role in the characterization of admissible 
strategies and their outcomes [311]. Fix a game G. A state s 


Synthesis 



Figure 2: Game G with two players V = 
{1, 2}. Player 1 controls the round states, and 
has objective GFs 2 , and player 2 controls the 
square state and has objective GFsi. 
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has value 1 for player i, written Vali(s) = 1, if player i has a winning strategy from s; Vali(s) = —1 if for all 
strategy profiles a-p € E-p, Outc, s(ct-p) does not satisfy ij)i] and otherwise Vali(s) = 0. A player j decreases 
its own value in history h if there is a position k such that \/a\j{hk) > Valj(hfc+i) and hk G Sj. We proved in 
[1], that admissible strategies do not decrease their own values. Let us call such strategies value-preserving. 
In fact, if the current state has value 1, there is a winning strategy which stays within the winning region; 
if the value is 0, then although other players may force the play into states of value — 1 , a good strategy for 
player i will not do this by itself. 

Lemma 2 ([4l Lem. 1]). For all games G, players i, and histories p, if last(p) S Si and ai € Adrrii then 
Vali((5(last(p), ai{p))) = Vali(last(p)). 

For player i, let us define the sets Vi^x = {s \ Vali(s) = x} for x G { — 1,0,1}, which partition S. We define 
the set of value-preserving edges for player i as Ei = {(s,a) G S x Act | s G Si => Vali((5(s,a)) = Vali(s)}. 
Observe that value-preserving strategies for player i are exactly those respecting Ei. In our running example 
of Fig. [21 it should be clear that any strategy that chooses a transition that goes to S 3 is not admissible nor 
for Player 1 neither for Player 2, as by making this choice both players are condemned to lose their own 
objective while their other choices leave a chance to win; so the choice of going to S 3 would decrease their 
own value. So, we can already conclude that Player 2 always chooses S 2 1 -^ si, his only admissible strategy. 

Not all value-preserving strategies are admissible: for Muller objectives, staying inside the winning region 
does not imply the objective. Moreover, in states of value 0, admissible strategies must visit states where 
other players can “help” satisfy the objective. Formally, help states for player i are other players’ states with 
value 0 and at least two different successors of value 0 or I. Let Hi = {s G S\Si \ Vali(s) = 0 Ads' 7 ^ s". s' G 
(5(s, Act) A s" G (5(s, Act) A Vali(s') > 0 A Vali(s") > 0}. Given this, the following lemma, adapted from [4], 
characterizes the outcomes of admissible strategies. We denote by G{Ei) the set of runs that respect Ei, i.e. 

Lemma 3. For all games G, and players i, Outc fl <I>i = OutG(Admi, E_i), where = G{Ei) A (GF(Fi_i) ^ 
</>,) A(GF(I4,o) ^(/),VGF(iJ,)). 

In our running example of Fig.|21 a strategy of Player I which, after some point, always chooses si 1 —si is 
dominated by strategies that chose infinitely often si 1 —S 2 . This is a corollary of the lemma above. Indeed, 
while all those strategies only visit states with value 0 (and so do not decrease the value for Player 1), the 
strategy that always chooses Si 1 —>• si has an outcome which is loosing for Player I while the other strategies 
are compatible with outcomes that are winning for Player I. So, outcome of admissible strategies for 
Player I that always visit states with values 0, also visits S 2 infinitely often. Using the fact that strategies 
are value-preserving and the last observation, we can now conclude that both players have (admissible) 
winning strategies against the admissible strategies of the other players. For instance when Player I always 
chooses to play si >->■ S 2 , he wins against the admissible strategies of Player 2. 

Note that can be decomposed into a safety condition Si = G{Ei) and a prefix independent condition 
Mi = (GF(U,i) A (GF(U_o) if'i V GF{Hi)) which can be expressed by a Muller condition described 

by a circuit of polynomial size. 

For player i, we let Sti = OutG(Admi) A (OutG(Adm_i) (pi), which describes the outcomes of admissible 
strategies of player i, which satisfy objective (pi under the hypothesis that they are compatible with other 
players’ admissible strategies. In fact, it follows from [4] that captures the outcomes of AA-winning 
strategies for player i. 

Lemma 4. A player i strategy is AA-winning iff it is winning for objective Qi. 

Objective Sti is not directly expressible as a Muller condition, since <I>i and /\j contain safety parts. 
Nevertheless, the information whether G{Ei), or G{\Jj^iEj) has been violated can be encoded in the state 
space. Formally, for each player i, we define game G' by taking the product of G with {T,0,_L}; that is, 
the states are S x {T,0,_L}, and the initial state (smit)O). The transitions are defined as for G for the first 
component; while from state (s, 0), any action a outside Ei leads to (d(s, a), T), and any action a outside Ej, 
j 7 ^ i, leads to (d(s, a),T). The second component is absorbing at T,T. We now rewrite the condition 


for G' as = (GF(S x {0}) A M[ A {Aj^iMj =^> ^9) V (GF(S x {T}) A M'), where M' is the set of runs of G' 
whose projections to G are in Mi, and similarly for 0'. 

Now, checking AA-synthesis is reduced to solving games with Muller conditions. Moreover, we also obtain 
a polynomial-time algorithm when all objectives are Biichi conditions, by showing that is expressible by 
a parity condition with four colors. 

Theorem 4. AA-synthesis in multiplayer games is PSPAC^-complete, and P-complete for Biichi objectives. 
Player i wins for objective Vli in G iff he wins for objective fl' in G'. 


7 Abstraction 

We present abstraction techniques to compute assume-admissible strategy profiles following the abstract 
interpretation framework m ; see m for games. Abstraction is a crucial feature for scalability in practice, 
and we show here that the AA rule is amenable to abstraction techniques. The problem is not directly 
reducible to computing AA-winning strategies in abstract games obtained as e.g. in |13j : in fact, it can 
be easily seen that the set of admissible strategies of an abstract game is incomparable with those of the 
concrete game in general. 

Overview. Informally, to compute an AA-winning strategy for player i, we construct an abstract game A'i 
with objective ffi s.t. winning strategies of player i in A'i map to AA-winning strategies in G. To define A', 
we re-visit the steps of the algorithm of Section [5] by defining approximations computed on the abstract state 
space. More precisely, we show how to compute under- and over-approximations of the sets Vx,k, namely 
and Vx,ki using fixpoint computations on the abstract state space only. We then use these sets to define 
approximations of the value preserving edges and Ek) and those of the help states (iJj. and Hk). These 
are then combined to define objective s.t. if player k wins the abstract game for then he wins the 
original game for flj., and thus has an AA-winning strategy. 

Abstract Games. Consider G = {A, {(j)i)i^-p) with A = {'P,{Si)i^'p,s\„\t,{Acti)i^-p,6) where each (ji is a 
Muller objective given by a family of sets of states Let = 1+Jjgp denote a finite set, namely 

the abstract state space. A concretization function 7 : i—>■ 2^ is a function such that: 1. the abstract states 
partitions the state space: l+J^agga 7 (s^) = S, 2 . it is compatible with players’ states: for all players i and 
s^ G Si, 7 ( 5 ^) C Si. We define the corresponding abstraction function a : S where a{s) is the unique 

state s.t. s G 7 (s^). We also extend a, 7 naturally to sets of states; and to histories, by replacing each 
element of the sequence by its image. 

We further assume that 7 is compatible with all objectives J-t in the sense that the abstraction of a set S 
is sufficient to determine whether S G J-p. for all i £ P, for all S,S' C S with a{S) = a{S'), we have 
S G Ti S' G J-i. If the objective 4>i is given by a circuit, then the circuit for the corresponding abstract 
objective (fl is obtained by replacing each input on state s by a{s). We thus have p G 4>i if, and only if, 
a{p) G 

The abstract transition relation A® induced by 7 is defined by: {s^,a,P) £ A® 3s £ 7 ( 3 ^), 3t £ 
'y{P),t = 6{s,a). We write post^(s^,a) = {P G | A{s^,a,P)}, and post^(s^. Act) = UaeActPOst^(s^, a). 
For each coalition C C V, we define a game in which players C play together against coalition — C; 
and the former resolves non-determinism in A^. Intuitively, the winning region for C in this abstract 
game will be an over-approximation of the original winning region. Given C, the abstract arena A'^ is 
{{C, -C}, (Sc, S_c), a(sinit), 

(Actc, Act-c),(5"’'^), whereSc = (U*6C S")u(Ue-p S? x Acl),S-c = Ui^c S"; and Acte = (U*ecAcL)uS" 

and Act_c = Uie-c^*-^*- relation is given by: if s^ £ S^, then 5^'^{s^,a) = (s^,a). If 

(s^,a) £ X Act and P G satisfies {s^,a,P) G A^ then ((s^, a), = P] while for {s^,a,P) ^ A^, the 

play leads to an arbitrarily chosen state with A(s^,a,u^). Thus, from states (s^,a), coalition C chooses 
a successor P. 

We extend 7 to histories of A^ by first removing states of (S- x Acti); and extend a by inserting these 
intermediate states. Given a strategy a of player k in A^, we define its concretization as the strategy 7 ( 0 ") 
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of G that, at any history h of G, plays 7 (cr)(/i) = a{a{hj). We write for the states of S® from 

which the coalition D has a winning strategy in for objective (j)%, with D € {C,—C}. Informally, it 
is easier for coalition C to achieve an objective in AP than in G, that is, \N\nc{A‘^ over-approximates 
Winc(A, (j)k): 

Lemma 5. If the coalition C has a winning strategy for objective ifk in G from s then it has a winning 
strategy for 0 ^ in A^ from a(s). 

Value-Preserving Strategies. We now provide under- and over-approximations for value-preserving 
strategies for a given player. We start by computing approximations ^ and Vk,x of the sets Vk,x, and then 
use these to obtain approximations of the value-preserving edges Ek- Fix a game G, and a player k. Let us 
define the controllable predecessors for player k as GPRE^p\{fc} ki^) = {s^ G | 3a £ Actfc, post^(s^, a) C 
X} U {s® G ^v\{k} I ^ Act_fc, post^(s^, a) C X}. We let 

Vkp = \N\nr\{k}{A^\^AnWinriA'P 

V_k 0 = i^V.(CPRE^i 3 \{fc} k{X ^V_k 1 U Vj, _ 2 ^) n f), 

where F = Winp\{fe}) n Winp(.4®, 

The last definition uses the vX.f{X) operator which is the greatest fixpoint of /. These sets define approxi¬ 
mations of the sets 14,x- Informally, this follows from the fact that to define e.g. Vk,i, we use the game A^A ^ 
where player k resolves itself the non-determinism, and thus has more power than in G. In contrast, for V_k i, 
we solve APAA where the adversary resolves non-determinism. We state these properties formally: 

Lemma 6. For all players k and x G {—1,0,1}, 'y{V_i.p C I4,x C 'y{Vk,x)- 

We thus have Ux 7 (V/c,x) = S (as UxFfe.x = S) but this is not the case for so let us define V_ = 

We now define approximations of Ek based on the above sets. 

Ek = {(s4a) G S" X Act I G ^ 3a:, G Ffc,x, post^(s4 a) r\Ui>xVk,i p 0}, 

Ek = {{A, a) G 5=' X Act | G ^ 3a:, G Fj, post^(s4 a) C Ui>xFfe J U {(s4 a) | s^ ^ Fj. 

Intuitively, Ek is an over-approximation of Ek, and Fj, under-approximates Ek when restricted to states 
in F (notice that contains all actions from states outside F). In fact, our under-approximation will be 
valid only inside F; but we will require the initial state to be in this set, and make sure the play stays 
within F. We show that sets F^, and Ek provide approximations of value-preserving strategies. 

Lemma 7. For all games G, and players k, Stratfc(Ffc) C 7 (Stratfc(Ffe)), and if Sinit G 7 (F), then 0 7 ^ 
7 (Stratfe(Fj,)) C Stratfe(Ffe). 

Abstract Synthesis of AA-winning strategies. We now describe the computation of AA-winning strate¬ 
gies in abstract games. Consider game G and assume sets E_^,Ei are computed for all players i. Roughly, 
to compute a strategy for player k, we will constrain him to play only edges from Fj,, while other players j 
will play in Ej. By Lemma 0 any strategy of player k maps to value-preserving strategies in the original 
game, and all value-preserving strategies for other players are still present. We now formalize this idea, 
incorporating the help states in the abstraction. 

We fix a player k. We construct an abstract game in which winning for player k implies that player k has an 
effective AA-winning strategy in G. We also define A} = ({{fc}, —k}, (S'^, S'!_j.US'^ x Act), a(sinit), (Act^, Act_fe), ( 5 _ 4 fc), 
where S'^ — 5^ x (T, 0, T}; thus we modify A^AA by taking the product of the state space with {T, 0, Tj. 
Intuitively, as in Section [51 initially the second component is 0, meaning that no player has violated the 
value-preserving edges. The component becomes T whenever player k plays an action outside of F^; and T 
if another player j plays outside Ej. We extend 7 to A} by 'y{{s^,x)) = 7 ( 5 ^) x (xj, and extend it to 


Ffe,_i =Win0(A®,-</>!), 
Vk,-i=WMA^,^rk) 
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histories of by first removing the intermediate states S'^ x Act. We thus see A!f. as an abstraction of A' 
of Section [6] 

In order to define the objective of let us first define approximations of the help states Hk, where we 
write A(s^,Act,t^) to mean 3a G Act, A(s^, a, 

Hk = {s^ G Vk,o \ Sfe I G Vk,o U Vk,i- A(s^ Act,t^) A A(s^ Act, u^)} 

Kk = {s" G Vk,o \ S| I 3a 7 ^ 5 G Act, post^(s", a) n post^(s=, &) = 0, 

Post^(s",a) U post^(s%6) C UZfc,i}- 

We define the following approximations of the objectives M'f. and in A'/.. 

M'fe = (GF(Ffc.i) ^ </>!) A (GF(Ffc,o) ^ (</>! V GF(^,))) , 

M'k = (GF(Ffc,i) ^ </>!) A (GF(Z,.^o) V GFiHk))) , 

^k = (gf(S= X {0}) am:, a M', => V (GF(S^ X {T}) A M:,) . 

Theorem 5. For all games G, and players k, if Sinit G y_, and player k has a winning strategy in A'j^ for 
objective fl'k, then he has a winning strategy in G^, for 17,; and thus a AA-winning strategy in G. 

Now, if Theorem [S] succeeds to find an AA-winning strategy for each player fc, then the resulting strategy 
profile is AA-winning. 


8 Conclusion 

In this paper, we have introduced a novel synthesis rule, called the assume admissible synthesis, for the 
synthesis of strategies in non-zero sum n players games played on graphs with omega-regular objectives. 
We use the notion of admissible strategy, a classical concept from game theory, to take into account the 
objectives of the other players when looking for winning strategy of one player. We have compared our 
approach with other approaches such as assume guarantee synthesis and rational synthesis that target the 
similar scientific objectives. We have developed worst-case optimal algorithms to handle our synthesis rule 
as well as dedicated abstraction techniques. As future works, we plan to develop a tool prototype to support 
our assume admissible synthesis rule. 
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A Complements on Synthesis Rules (Section [3]) 

We give the proof of Theorem [TJ 

Theorem 1. For all AA-winning strategy profile a-p, G,(T-p |= 

Proof. Let ap be a strategy profile witness of AA. Let z be a player, we have that a-i € Adm_i(G), because 
by condition 1, for all j i, aj € Admj(G). Then by condition 2 we have that G, ap j= (pi. Since this is true 
for all players i, we have that G, ap ^ Ai^p A- ^ 

The following example shows that AA-winning strategies must be admissible themselves for Thm. [T] to 
hold. 



Figure 3: Illustration of the necessity of Condition 1 in the definition of Assume-Admissible Synthesis. 
Player 1 controls circles and player 2 squares. 


Example 1. In AA, the profile of strategy must be composed of admissible strategies only. This is necessary 
otherwise the players may make assumptions on each other which are not satisfied. This is illustrated by 
the example of Figure 0 in which two players have reachability objectives (pi and (p 2 respectively. The states 
labeled by (pi are the target states for the corresponding objective. Admissible strategies are shown in thick 
edges. Now, the player 2 strategy that chooses the dashed edge from S 2 satisfies Condition 2 of AA, since 
S 2 is not reachable under admissible strategies of player 1. Similarly, the player 1 strategy that chooses the 
dashed edge from si satisfies Condition 2 of AA since the thick edges lead back to a state satisfying (pi. But 
then the resulting profile is such that none of the two players win. 

We explain why the pair of strategies (ui, 172 ) is a solution for the synthesis rules Coop, AG, RS^(NE, SPE): 

• Rule Coop: the outcome of ((Ti,CT 2 ) is ”Env emits ri, then Sched emits qi, Env emits r 2 , then Sched 
waits one round and emits q 2 , and repeat.” This outcome is a model for both </)Sched and ^^Env 

• Rules AG^’^: the two rules coincide as we have only two players. When both players follow {ai,a 2 ), 
we know that the outcome is a model for both (/>sched and (pEm- We must in addition verify that 
Out(CTi) ^ pEnv —^ (/'Sched and that Out((T 2 ) H ^^Sched —^ </>Env This is the case as either the outcome 
of the game between the two players is ”Env emits ri, then Sched emits qi, Env emits r 2 , then Sched 
waits one round and emits q 2 , and repeat”, or it is not but then player 1 switches to strategy di and 
this forces outcomes that falsify ^^Env The argument is similar to justify that Out(CT 2 ) H (/(Sched —^ ^^'Env 

• Rules RS^(NE, SPE, Dom): for the existential version of rational synthesis, we consider that Sched 
(player 1) plays the role of the system. We must then prove that (cti, (T 2 ) is such that (/)sched is satisfied 
and CT 2 is stable for NE,SPE, and Dorn. This is the case because Out(cri, 0 - 2 ) is a model for (/)sched, so it 
satisfies the specification of the scheduler. Furthermore, it satisfies the specification of the environment, 
and so the environment has no incentive to deviate: CT 2 is stable for NE. It is also stable for SPE as 
any deviation of the environment triggers strategy di for the scheduler and d 2 for the environment, 
as a consequence, there is no incentive for environment to deviate from d 2 as Out(di) ^ ~'(pEnv It is 
stable for Dorn because 172 is winning under ai. 
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We explain why the pair of strategies cji is a solution for the rules Win-under-Hyp and RS'^(NE, SPE): 

• Rule Win-under-Hyp; we have seen above that Out(cri) |= (/)Env — t </'Sched and so cti is a solution for 
Win-under-Hyp. 

• Rules RS'^(NE,SPE, Dorn): against tri, the only strategies for the environment that are stable for NE, 
SPE, Dorn are those that make sure that the outcome is equal to ”Env emits ri, then Sched emits gi, 
Env emits r 2 , then Sched waits one round and emits 92 , and repeat.” Indeed, any strategy that would 
deviate from that outcome would trigger the execution of &i for the scheduler and the outcome would 
then be losing for the environment making a deviation toward a 2 profitable for instance. 

Absence of dominant strategies Let us now convince the reader that Env has no dominant strategy 
for his specification. Indeed, it is easy to see that if there are dominant strategies then all admissible strategies 
must be dominant. Now, take two strategies in Adm(^Env) that behaves the same with the exception that 
the first strategy when a request is granted is true), directly issue another request for i £ {1,2}, while 
the second strategy waits one additional turn to issue the new request r^. It is easy to see that none of the 
two strategies dominate the other one, as the Sched can adapt his own strategy to make the first strategy 
win and the second loose, and the other way around. 


B Comparison of the Synthesis Rules (Section [5]) 

In this section, we detail the comparison of the synthesis notions we consider. We adopt the following 
notation: an implication A => B between two notions A and B means that B has a solution whenever A has 
one. Note that this does not always imply inclusion between the witnessing strategy profiles. The proof of 
Theorem [21 will be decomposed in several lemmas. 


B.l Proof of Theorem [2] 

Remark 1. We have RS^(SPE) ^ RS^(NE) and RS'^(NE) => RS'^(SPE) because any subgame perfect equi¬ 
librium is also a Nash equilibrium. Moreover, in the definition of the rules RS, the conditions for RS'^ are 
stronger than for RS^, so RS''(SPE) ^ RS^(SPE), RS'^(NE) ^ RS^(NE) and RS^(Dom) ^ RS^(Dom). 

Lemma 8. Win AA ^ Coop RS^(SPE) and Win AG^ AG^ ^ Coop, 


Proof. Win A A This holds because winning strategies are always admissible [2], therefore a profile witness 

of Win satisfies c ondition 1 and 2 of the definition of assume-admissible. 

A A =>- Coop This holds by Theorem [TJ 

Coop => RS^(SPE) Note that in order for RS to make sense we must have sys £ V. Assume Coop has a 


solution and let a-p be a profile of strategy such that for all player i, ap ^ (fi. 

We define a strategy profile cr', that follows the path p — OutG(cri) when possible (that is: if /i is a 
prefix of p then play act|ft|(p)) and if not follows a subgame perfect equilibrium: that is, we select for each 
state s a subgame perfect equilibrium af,, there always exist one for Borel games (so in particular for Muller 
games) [291 Theorem 3.15]; then if h is not a prefix of p, let j be the last index such that h<j = p<j and we 
define a'.p{h) = {h>j+i). 

Let A be a history. If h is a prefix of p then the objective of each player is satisfied by following cr' o h so 
none of them can gain by changing its strategy, therefore it is a Nash equilibrium from last(/i). If h is not 
a prefix of p then by definition of ct', players follow a subgame-perfect equilibrium since h deviated from p, 
so in particular ct' o /i is a Nash equilibrium from last(/i). Moreover the objective of the system is satisfied. 
Therefore ap is a solution to RS(SPE). 


Win ^ AG'' 


Let Up such that for each player i, Oi is winning for fii. The first condition in the definition 
of AG'' is satisfied because for all player i, OutG(cr-p) satisfies fii. The second condition is satisfied because 


15 











for all strategy a'_^, we have that OutG((Ti, tr^j) satisfies (pi, so in particular it satisfies (Vjg-p\{i} 'Pj Pi)- 
Hence a-p is a solution for AG^. 

This holds because the second condition in the definition of these rules is stronger for AG^. 
This implication holds simply because of the condition 1 in the definition of assume- 
guarantee, which corresponds to the definition of Cooperative synthesis. 

□ 


AG^ ^ AG^ 
AG^ Coop 


Lemma 9. For all 7 e {NE, SPE, Dorn}, Win ^ RS^( 7 ), RS''(Dom) ^ RS'^(NE) and RS^(Dom) =7 
RS^(SPE). 


Proof. 


Win =7 RS^( 7 ) 


Let ap be a strategy profile such that for each player i, ai is winning for pi. 


We first show that is not empty. For 7 G {NE, SPE} this is because there always exist a subgame 

perfect equilibrium for Borel games (so in particular for Muller games) [5^1 Theorem 3.15] and a subgame 
perfect equilibrium is a Nash equilibrium. For 7 = Dorn, note that by definition of dominant strategies, 
winning strategies are dominant, so contains at least tT_i. 

Let be a strategy profile for T" \ {1}. Since cti is a winning we have that \= pi. Therefore 

(Ti is a solution for RS'^( 7 ). 


Let ap be a witness for RS^(Dom). We define a strategy profile alp such that 
a[ follows Ui on all histories compatible with ai (that is if h prefix of p G OutG(cri) then a^(h) = ai{h)) and 
outside of these histories follows a subgame perfect equilibria: there always exist one for Borel games (so in 
particular for Muller games) [551 Theorem 3.15]. 

By definition of alp, the outcome OutG(cr)r,) is the same than OutG(cr'p). Because ap is a witness for 
RS^(Dom), this outcome is winning for player 1. 

It remains to show that a'_i is a subgame perfect equilibria. Let A be a history, i be a player different 
from player 1, and a'f be a strategy for player i. We show that from h player i does not improve by switching 
from al to another strategy a'f , which will show that a'p o A is a Nash equilibrium from h. 

If h is compatible with ai then tr' coincide with ai from this history, so OutG(cr', cr_i) = OutG(crp). Since 
ai is a dominant strategy, if G, a'/, a-i |= pi then G, ai, a-i ^ pi and therefore this implies that OutG(cr', a-i) 
satisfy pi. This means that i does not improve by switching from cr' to a'f. 

If h is not compatible with ai, then a[ plays according to a subgame-perfect equilibria since the first 
deviation. In particular, this strategy is a Nash equilibrium from h. 

This shows that a'_i is a subgame perfect equilibrium and has a'p ^ pi, this is a witness for RS^(SPE). 


RS^(Dom) ^ RS^(SPE) 


RS'^(Dom) ^ RS'^(NE) 


Consider the example given in Figure 0] The strategy r for player 2 is dominant 


and any strategy of player 3 is dominant. The outcome of these strategies always go to the bottom state 
where psys is satisfied. Therefore there is a solution to RS'^(Dom). 

However, we show that there is no solution to RS'^(NE). Consider the strategy profile {■,l,b), this is a 
Nash equilibrium (even a subgame Nash equilibrium) since no player can improve his/her strategy. Note 
that player I is losing for that profile, hence no strategy of player I can ensure that it will win for all Nash 
equilibria. □ 


In the example of Section |4j we saw that more strategy profiles satisfied the assume-guarantee condition 
compared to assume-admissibility, including undesirable strategy profiles. We show that the rule AG^ is 
indeed more often satisfied than AA; while the rules AG^, and A A are incomparable. 

Lemma 10. We have AG'^ ^ AA; AG'^ ^ AA; AA ^ AG'' and AA ^ AG". 


Proof. AG" 7 ^ AA and AG" 7 ^ AA Consider the game represented in Figure |5| In this example, we have 
Ad mi = El. Therefore player 2 has no winning strategy against all admissible strategies of Ad m 2 (in 
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Figure 4: Example showing that RS'^(Dom) ^ RS'^(NE). Player 2 controls circle states, player 3 square 
states and player 1 does not control any state. 



Figure 5: Example showing that AG 7 ^ AA. 


Player 1 controls circle states and player 2 square states. 


particular the strategy of player 1 that plays r, makes player 2 lose). So AA fails. However, we do have AG^ 
by the prohle cti : si M- I, a 2 - S 2 b, S 3 1 -^ c. This profile also satisfies AG'^ which is equivalent to AG^ for 
two player games. 

Consider the example of Figure | 6 l The profile where player 1 and player 2 plays to the right 
is assume-admissible. However there is no solution to assume-guarantee synthesis: if player 1 and player 2 
change their strategies to go to the state labeled </>i,^!) 2 , then the condition \= {cj)i A ^ 2 ) ^ ^^>3 is not 
satisfied. 


AA 7 ^ AG"^ 



Figure 6 : Example showing that AA 7 ^ AG^. Player 1 controls circle states and player 2 square states; 
player 3 does not control any state. 


AA ^ AG'^ 


We will provide a counter-example to show our claim. Note that we need strictly more than 


two players since otherwise AG'^ is equivalent to AG^, and we have just shown that A A implies AG^. 

Consider the game with three players in Fig. [T] Define the following objectives: (j)\ = GF(s 4 V S7), 
(j )2 = GF(s 4 V Se), 03 = true, where 4>i is player Fs objective. These are actually reachability objectives since 
the game ends in absorbing states. 

Now, action b is dominated at states S 2 and S 3 for player 2. Thus player 1 has a AA-winning strategy 
which consists in taking a at si. Player 2 has a winning strategy in the game (taking a at both states). 


17 
























Player 3 has a AA-winning strategy too since actions b are eliminated for player 2. Therefore, there is an 
A A-winning strategy profile which ends in 54 . 

On the other hand, there is no AG^ profile. In fact, player 1 has no winning strategy to ensure (/>2 V (^3 
01 , which is equivalent to 0 i since 03 = true. 


Ill 


001 


011 101 

Figure 7: Example showing that AA ^ AG^. Player 1 controls circle states and player 2 square states; 
player 3 does not control any state. At each absorbing state, the given Boolean vector represents the set of 
players for which the state is winning. 



□ 


Lemma 11. For two player games, AA => AG^. 

Proof. Assume Q is a two player game and consider strategy profile (fTi,cr 2 ) witness of AA. Note that if 
player j decreases its own value at position k then its value for h<k+i will be smaller or equal to 0 which 
means player j has no winning strategy from this history. By determinacy of turn-based zero-sum games, 
player 3 — j has a winning strategy for Therefore we can adjust the strategies (cti, a 2 ) such that if there 
is a player j that decreases its own value, the other player will make it lose. We write {a'^, alfj the strategies 
thus defined and we will show that they form a solution of Assume-Guarantee. 

Let p be the outcome of the strategy profile (crj, cr^)- We can show that p is also the outcome of (cti, cr 2 )- 
First we recall that an admissible strategy does not decrease its own value (Lemma [2]). Therefore each ct' is 
identical to Oi on the run p. By Theorem [U p satisfies 0i A 02. 

Let tr" be an arbitrary strategy profile for 1, and consider p' = OutG(cr", cr^)- We show that p' |= 0i => 02. 
Note that player 2 cannot be the first to decrease its value during p' since it behave according to (72 has long 
has there are no devition, and 172 is admissible and admissible strategies do not decrease their own values. 

• If player 1 decreases its value during p', player 2 will play to make him lose and p' ^ 0i. As a 
consequence p' ^ 0i 02 . 

• Otherwise no player decreases its own value during p'. We assume that p' \= 0i and show that p' ^02. 
Since p' ^ 0i, by Lemma [T51 there is a strategy r" which is admissible and compatible with p'. Since 
p' is an outcome of a' 2 , and of r", we have OutG(T", 0 - 2 ) = p'■ Now, since is admissible and by the 
fact that (72 satisfies the condition 2 of AA, we obtain p' |= 02 , which proves the property. 

We can show the same property replacing the roles of player 1 and player 2, thus showing that the profile is 
solution of AG^. □ 


To prove that there is no implication for the edges that are not in the diagram of Figure [TJ it remains to 
be shown that: 


Lemma 12. AA Win, AG^ Win, Coop y^ AA, Coop ^ AG^, Coop RS^(Dom), and for all 
7 e {NE, SPE, Dorn}, RS^’'^( 7 ) y^ Coop. 


Proof. AA y^ Win 


Towards a contradiction assume A A ^ Win, then since we have Win 
we would have A A ^ AG^ but this contradicts Lemma [TOl 


AG^ (Lemma[8]), 
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AG'^ 7 ^ Win 


Towards a contradiction assume AG^ Win, then since we have Win 
we would have AG^ AA but this contradicts Lemma [TUI 


AA (Lemma [5]), 


Goop 7 ^ AA In Figure [SJ we have an example of a game where there is no solution for AA (see the proof 


of Lemma [TUI for details), however there is a solution for Goop: {I, b). 

Coop ^ AG^ Consider the example of FigureEl There is a solution for Coop: player 1 plays a. However 


there is no solution for AG : player 2 has no strategy to ensure that (pi 


Coop ^ RS^(Dom) Consider the example of FigurelHl This example has a solution for Coop, for instance 


(Z, ac) or (r, bd). However player 2 has no dominant strategy: I looses against bd so it is dominated by r, and 
r looses against ac so it is dominated by 1. Therefore RS^(Dom) has no solution. 




Figure 8 : Example showing that 

Coop tA AG^. Player 1 controls Figure 9: Example showing that Coop 7 ^ 
the circle state. RS^(Dom). Player 2 controls circle states, 

player 3 square states and player 1 does not 
control any state. 



Figure 10: Two-player 
game showing that 
RS^’''( 7 ) 7 ^ Coop. 

Player 2 controls the 
circle state but has no 
choice. 


RS^’''( 7 ) 7 ^ Coop 


Consider the example of Figure [TUI There is no solution for Coop: player 2 can never 


win. However there is a solution for any concept in RS^’'^( 7 ): player 1 wins against any of the strategy 
satisfying these concepts since the only possible outcome is winning for him. 


□ 


This concludes the proof of Theorem [2] 

B.2 Proof of Lemma [1] 

We now give the proof of Lemma [T] 

Lemma 1. Let C = {A,(pi,p 2 ) be a two-player game. If player 2 has a winning strategy for p 2 ctnd Win- 
under-Hyp has a solution, then AA has a solution. 

Proof. Assume that alf is a winning strategy for p 2 and let CTi,cr 2 be a solution of Win-under-Hyp. We 
have that Vct 2 . ci, CT 2 \= (p 2 ^ 4>i and Vtri. <Ji,<J 2 \= Pi => d' 2 • Since ulf is a winning strategy, all admissible 
strategies of player 2 are winning. Then, for all CT 2 S Adm 2 , we have C, iji, (T 2 \= p 2 and because Vcr^j <^ 1,(^2 1= 
p 2 =^ 01 , we also have that G, cti, cr^ H '/'i- If <^1 dominated, there exists a non-dominated strategy cr“ that 
dominates it Thm. 11], otherwise we take cr“ = cti. In both cases af is admissible. As cti is dominated by 
(tJ, G,(Ti,cr 2 \= Pi implies G,cr“,cr 2 |= Pi- This shows that the condition Vcr^ S Adm 2 (G). G,(t“,(T 2 \= Pi is 
satisfied. Since alf is winning, it is admissible and we also have Vcr^ G Admi(G). G,a'i,a^ |= p 2 . Therefore 
all conditions of Assume-Admissible are satisfied by (cr“, alf). 

□ 


We now prove Theorem [3] 


19 















Theorem 3. We have 1. Rule AA is rectangular; and for all games G, AA-winning strategy profile ap, coali¬ 
tion cep, e Admc(G), then G,ct_c,<7^ |= Ai^-c rules Win and RS'^(Dom) are rectangu¬ 
lar; the rules Coop, NCA , AG'^, 

RS^(NE,SPE,Dom), and RS'^(NE, SPE) are not rectangular. 


Proof. AA is rectangular If there is no solution to AA, then the set of witness is empty, and therefore is 


rectangular. If there is only one solution, then it is the Cartesian product of singletons and therefore also a 
rectangular set. 

Otherwise let a-p and af, be two solutions of AA. Let i be a player of V, we show that is also a 

solution of AA. We have that cn £ Adm(G) and for all j yf i, aj £ Adm(G), because condition I holds for 
ap and alp. Therefore condition I holds for ai,a'_^. Similarly, Vcr^_j £ Adm_i(G). G,cr',cri \= (fa and for all 
j yf i, '^a'_j £ Adm_j(G). G,cr',(7j |= (j)j, because condition 2 holds for ap and alp. Therefore condition 2 
holds for ai,a'_^ and it is a witness of A A. 

Let be the set of strategy ai such that there exists (T_i such that ai,a-i is a witness of AA. We can 
show that the set of witness of A A is the Cartesian product of the E““. We obviously have that the set of 
solutions is included in n.cpsr- Let ap be a profile in n.cpsr, and a'p a witness of AA. We can replace 
for one i at a time, the strategy tr' by ai in a'p and by the small property we previously proved, the strategy 
profile stays a solution of AA. Therefore ap is a solution of AA. This shows that the set of solutions is the 
rectangular set Y\i^p 

This claim follows from the definition of AA-winning strategy 




£ Admc(G) ^ G,a-c,cr'c h Aie-C 


profiles, since each strategy is winning against admissible strategies. 


Now Consider any game G and fix a profile ap such that G, ap \= Ai<i<n 4>i- 


another profile solution to Win. Then the strategy tr' ensures 4>i against any strategy profile for —i. If 
we replace at by cr' in the profile ap then the condition for Win are still satisfied. Thus the rule Win is 
rectangular. 

Let CTi be a solution of RS^(Dom). Let 0-2, ..., cr„ and (T2, ..., be profiles of such 

that (Ti,CT2, . .., Un \= 4 >i s-iid CTi, ( 72 ) ■ • ■)\= 4 >i- K we define a profile T2,... ,Tn where each is either ai or 
a'i, then as each is dominant, we have cti, T2,.. ., \= (I>i because ai is a solution of RS'^(Dom). Therefore 

the profile belongs to and makes ui win. This shows that the rule is rectangular. 

Consider the example of Figure [TTJ Since player 2 and player 3 are always winning, all their 
strategies are dominant. There is only one strategy ai for player I since it controls no state. The profiles 
(a,c) and {b,d) are strategies of such that cri wins for (j)i, but the profile {ai,a,d) does not make (f)i 

hold. The rule is therefore not rectangular. 


RSADom) 


RS'^(Dom) 


Win 


Assume ap is solution to Win, then each ai is a winning strategy. Let cr' be a strategy part of 




Figure II: Game with three players showing that 
rule RS^(Dom) is not rectangular. Here, player 1 
controls no state; player 2 controls the square 
state, and player 3 controls the round state. 


Figure 12: Game with three players showing that 
rule RS^(NE) and RS^(SPE) are not rectangular. 
Player I controls no state, player 2 controls the 
square state and player 3 the round state. 


RS'^(NE) and RS'^(SPE) Consider the game represented in Figure fT^ Player I has only one strategy cri 


and the other players have two possible strategies: a and b for player 2 and c and d for player 3. Since 
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Figure 13: Game with three players showing 
that rule RS^(NE) and RS^(SPE) are not rect¬ 
angular. Player 1 controls no state, player 2 
controls the square state and player 3 the round 
state. 



Figure 14: Game with two players showing that 
rule AG^ and AG^ are not rectangular. Player 1 
controls the round state and player 2 the square 
state. 


player 1 is always winning, cti is a solution for RS'^(NE, SPE). The profiles (a, c) and (b, d) are two (subgame 
perfect) Nash equilibria which make (j)i hold. However the profile (a, d) obtained by picking one strategy in 
each profile, is no longer a Nash equilibrium (and so not a subgame perfect equilibrium). Therefore RS'^(NE) 
and RS'^(SPE) are not rectangular. 


RS^(NE) and RS^(SPE) 


Gonsider the game represented in Figure 1131 


Player 1 has only one strategy 


and the other players have two possible strategies: a and b for player 2 and c and d for player 3. The profiles 
(a, c) and (5, d) are two (subgame perfect) Nash equilibria which make 4>i hold. However the profile (a, d) 
obtained by taking one strategy in each profile, is no longer winning for player 1. Therefore RS^(NE) and 
RS^(SPE) are not rectangular. 


Coop Once again, consider the game represented in Figure [TSl The profiles (a, c) and (6, d) make all 


players win, but the profile (a,d), is no longer winning for the player 1, so it is not a solution of Coop. 
Therefore Coop is not rectangular. 


Consider the game represented in Figure [HI The profiles (a,c) and (6,d) make the two 
players win. Since all possible outcome of the game satisfy the implications 4>\ => (f >2 and (/)2 => (/>i, both 
profiles are solution to AG^ and AG^ (note that the two concepts coincide here because there are only two 
players). However the profile (a, d) obtained by taking one strategy in each profile, is no longer winning for 
the player 1. Therefore AG^ and AG^ are not rectangular. 


AG'^ and AG^ 


□ 


C Complements on Algorithm for Assume-Admissible Synthesis 
(Section [6]) 

In this section, we recall some detailed results on values and admissible strategies, and give the details of 
the algorithm for assume-admissible synthesis. 

C.l Values of Histories and Admissibility 

For the proofs, we actually need a more refined notion of value, which coincides with the notion defined in 
the core of our paper for prefix-independent objectives. Let us give the formal definition from [5]. 

We fix a game G. Given a history h, and a set of strategies E' for player i, we write E'(/i) for the set of 
strategies of S' compatible with h, that is, the set of strategies ai such that h G OutG(cri)- We also write 
S'(/i)forn,epS'(h). 

Definition 1 (Value [5]). Let E' be a rectangular set of strategy profiles. The value of history h for player i 
with respect to E', written Vali(E',/i), is given by: 

• if every a-p G E'(/i) is losing for player i then Vali(E', h) = —1; 
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• if there is a strategy of Oi € such that for all strategy profiles a-i in the profile {ai,a-i) 

is winning for player i then h) = 1; 

• otherwise Vali(I]', h) = 0; 

We use the shorthand notation Vali(h) = Vali(S, h). Notice that the value only depends on the last state 
for prefix-independent objectives. We may thus write Vab(s) = Vali(/i) for s = last(/i); observe that this is 
the notation we use in the core of the paper. 

We noted in Lemma [5] that admissible strategies do not make the player’s value decrease. We prove here 
that conversely, any winning run on which player i does not decrease its own value is compatible with an 
admissible strategy of player i. 

Lemma 13. Let i be a player, and p a history. If p \= (fi and player i does not decrease its own value in any 
prefix of p, then there exists a strategy profile {ui, a-i) € Adm^ x S-i such that p is the outcome of (ui, a-i). 

Proof. We define the strategies ai and a-i to follow p when possible (if the current history is a prefix of 
p, then proceed to the following state of p), and if a deviation has occurred in p, that is there is k such 
that hk = Pk, hk+i 7 ^ Pk+i, then starting from h<k+i, ai follows a non-dominated strategies with respect to 
T,{hk+i) (Thanks to [5] and the fact that this set allows shifting [5] such a non-dominated strategies exists). 
The run p is obviously an outcome of this profile. We now have to show that the strategy ai that we define 
is admissible. According to [H Lem. 9], it is enough to show that for every history h outcome of ai, the value 
for player i with respect to {ai} x is greater or equal to that of E. 

Let h be a finite outcome of ai. We distinguish the case where h has deviated from p and the case where 
it has not. 

If a deviation has occurred, then ai follows a strategy non dominated with respect to E(h</c+i) where k 
is the last index where hk = Pk- By [2] Lem. 9], the value of {ai} x H-i{h) in h is greater or equal to that of 
E(h<fc+i). Since Yi-iih) C E_i(/i<fe+i), the value of h with respect to {ai} x T,-i{h) is greater or equal to 
that with respect E(/i). Note that by the definition of the value, the value of h with respect to a rectangular 
set E' is equal to that of h with respect to E'(h). Therefore the value of h with respect with {ai} x E is 
greater or equal to that with respect to E. 

If a deviation has not occurred then /i is a prefix of p. The value of h with respect to E is greater or 
equal to 0 since p is winning for (fi. Then: 

• if the value is 0, then as there is an outcome of ai after this history which is winning (the run p), the 
value of ai is at least 0; 

• if the value is 1, then we can show that from this state ai plays a winning strategy: if we stay along p, 
the run is winning, if we deviate, the run goes to a state of value 1 (player i has not decreased its own 
value and from a state of value 1 for player i, any action of the adversaries lead to a state of value 1), 
and therefore ai revert to a winning strategy. 

Therefore the property is satisfied by ai and it is admissible. 

□ 

We now prove Lemma [3] characterizing the outcomes of admissible strategies. This result follows from [4], 
but we adapt it to make the formulas <I>i appear explicitly. 

Lemma 3. For all games G, and players i, Outc fl <I>i = OutG(Admi, E_i), where ^i = G(Ai) A (GF(yi^i) ^ 
A(GF(I4,o) ^</>^VGF(i^0)• 

Proof. In [U Lemma 6], an automaton A} is defined such that A\ nOutG(E) = OutG(Admi, E_j). (Note that 
a more general construction Af was given in [4]; we only need the case n = 1 here.) 

We now analyze further the language of Af. The edges are those of G except for edges outside of Ei 
(these edges are noted T in [3]), so the set of runs in A\ corresponds to Outo H G{Ei). Now a run of Aj is 
accepted if, and only if one the following condition is satisfied, writing VR(p) for the sequence (Val(pi))igN: 
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• VR(p) e 0*(-l)‘^; 

• VR(p) G 0*1“^ and p \= 

• VR(p) G 0“^ and p \= (j)^ or p \= GF{Hi). 

Any run of Outc H G{Ei) reaching some state of value —1 is necessarily losing; thus all successors also 
have value —1. Similarly, because we removed edges where player i decreases its own value, once the runs 
reaches a state of value 1, it never gets out of these states. Therefore runs of Outc fl G{Ei) have one of the 
three forms: 0*(-l)‘^, O^T^ or 0“^. 

Let p be a run that is accepted by A\, it satisfies G{Ei) and: 

• if p ends in the states of value —1 then it does not visit Vi^i or Vi^ infinitely often and thus belongs to 

• if p ends in the states of value 1, then by the acceptance condition it satisfies and thus belongs to 

• otherwise it stays in the states of value 0, then by the acceptance condition, either it satisfies (pi or 
GF{Eli) and thus belongs to 

Now let p be a run that satisfies pi, it satisfies G{Ei) and therefore corresponds to a valid run of A}. 

• If p ends in the states of value —1 then condition VR(p) G 0*(—1)“^ is satisfied, thus p is accepted by 

Al 

• If p ends in the states of value 1, then by definition of it satisfies pi and condition VR(p) G 0*1“^ 
and p \= pi is satisfied, thus p is accepted by A}. 

• Otherwise it stays in the states of value 0, then by definition of <I>i, either pi or GF{Hi) holds for p, 
hence VR(p) G 0‘^ and p \= pi or p \= GF{Hi) is satisfied, thus p is accepted by A\. 

This shows that n Outc = A} H Outc and by [H Lemma 6], this equals Out(Admi, E_i). 

□ 


C.2 Algorithm 

We start by a characterization of games with AA-winning strategy profiles. An assume-admissible winning 
strategy ensures its objective against all admissible strategies of other players; we formalize this in the 
following lemma using values (see Definition [IJ . 

Lemma 14. For all game G, there is an AA-winning strategy profile if, and only if for all players i, 

Vab((Adrrij , Sjnit) — 1- 

Proof. By definition of the value we have for all player i, that: 

Vali((Admi(G))^gp,Sinit) = 1 Bcr* G Admj(G). Vcr_j G Adm_i(G). G, 0 -^,cr_i |= (1) 

I ^ I If cr-p is a witness for Assume-Admissible then for all players i, ai G Adm.i(G) is such that for all 
cr_i G Adm_i(G), G,tTi,tT_i ^ pi. Hence by ([T]), for all players i, Vab((Admi(G))igp, Sinit) = 1- 

I I Assume that for all players i, Vali((Admi(G))igp, Sinit) = 1. ByO we have that for all players i, there 
exist ai G Admi(G) such that \/a-i G Adm_i(G). G,ai,a-i \= pi. Let av be a strategy profile made of such 
strategies ai. We have that for all player i, ai G Admi(G) (condition 1 in the definition of Assume-Admissible) 
and moreover for all a'_^ G Adm_i(G), G,ai,a'_^ ^ pi (condition 2 in the definition of Assume-Admissible). 
Hence the strategy profile a-p constitutes a solution to Assume-Admissible. 

□ 
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The algorithm to synthesize AA-winning strategies uses procedures from [1], originally developed to 
compute the outcomes that survive the iterative elimination of dominated strategies. More precisely, the 
elimination procedure of [1] first computes the outcomes of admissible strategies; from this it deduces the 
strategies that are not dominated when all players are restricted to admissible strategies, and their possible 
outcomes; and this is repeated until the set of outcomes stabilizes. In the end, one obtains the set of the 
runs that are the outcomes of strategy profiles that have survived this iterative elimination. 

Here, we roughly consider the first iteration of the above procedure, and explicitly give algorithms to 
actually synthesize strategies that are winning against admissible strategies. 

We explain the proof of Lemma 0] which follows from [3] . 

Lemma 4. A player % strategy is AA-winning iff it is winning for objective 

Proof. It is shown in [H Prop. 5] that a strategy of player z is a strategy of S” which is winning from state s 
against all strategies of 11"^ if, and only if, it is winning for objective H”(s) (where E” is the set of strategy 
that remain after n step of elimination). The results immediately follows of the case n = 1. 

□ 

We will now establish the correctness of the game G'. Let us first formalize the correspondence between 
G and G'. We define relation ^ C S x S': for all s € S x {_L,0, T}, s ~ {s,x). We extend this to runs by 
p ~ p' iff for all z e N, Pi ~ p'. In fact the relation defines a bijection: 

Lemma 15. For any p G Outc there is a unique p' G Outc' such that p p'. 

Proof. Assume towards a contradiction that we have p' and p" such that p = 7r(p') = 7r(p"). Let i be 
the last state such that they coincide: p' = p" and p'_|_^ ^ p'l+i- Since 7r(p') = 7r(p") we have that they 
differ only by the second component. We can assume without loss of generality that there is are actions a 
and b such that {pi,a) € Ej (where player j controls pi), {pi,b) ^ Ej and S{pi,a) = pi+i = 5{pi,b). This 
means that there is are actions a and b such that (s, a) € Ej (where player j controls pi), (s, 6) ^ Ej and 
S{pi,a) = pi+i = d{pi,b). We have S{pi,b) = S{pi,a), then by definition of Ej and because {pi,a) G Ej, 
Valj((I(s,a)) = Valj(s) therefore Valj(d(s, 6)) = Valj(s) and by definition of Ej, {pi,b) belongs to Ej which 
contradicts our assumptions and ends the proof. 

□ 

We write tt the bijection which, to p' G Outc'. associates p G Outc with p ~ p'. We extend tt as a mapping 
from strategies of G' to strategies of G by 7r(cr')(/i) = (T'(7r“^(/i)). We can notice that for all strategies cr', 
7r(OutG'(cr')) = OutG(7r(cr')). 

Lemma 16. Let G be a game, and i a player. Player i has a winning strategy for Lli in G if, and only if, 
he has a winning strategy for in G'. Moreover if cr' is winning for H' in G' then 7r(CT') is winning for Lli 

in G. 

Proof. We will first rewrite in a form that is closer to that of H'. The objective is defined by 
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OutG(Admi) n (OutG(Adm_i) ^ (j)i). Observe that OutG(Adm_i) = nj^iOutG(Admj) by definition. 


= OutG(Admi) n I Pi OutG(Admj) 
^OutG n p j 


Oi = n OutG n 




using Lem. [3] 


Oi = $, n OutG n p 

\j¥=i 



Oi = OutG A G{E,) AM, A 

Oi = OutG A G{E,) AM, A 


iAi 

0". 


V y ^G(£j) 

jAi 


v\/f{^e,) 

jAi 


I ^ I Let (Ti be a winning strategy for in G. We consider the strategy ct- defined by <j[{h') = ai{'K{h')) 
and will show that it is winning for O'. Let p' be an outcome of a[. We have that T^{p') is an outcome of ct^. 
Since Ui is winning for 7r(p') belongs to O^. 


• If 7r(p') 1= Mi A G{Ei) A \JF{-^Ej), then by construction of 5' the play p' reaches a state of S x {T} 
and, from there, only states of S x {T} are visited. The condition GF(S x {T}) A Mi is met by p' and 
therefore p' is winning for O'. 

• Otherwise 7r(p') ^ Mi A G(i?i) A (A^y^i ^j) ^ A- By construction of 5' the play p' stays in S x {0}. 
The condition GF(S x {0}) A Mi A (A^y^i ^j) ^ A is met by p' and therefore p' e O'. 

This shows that the strategy cr' is winning for O' in G'. 

I <t= I Let cr' be a winning strategy for O' in G', we show that 7r(cr') is winning for Oi in G. Let p be an 
outcome of 7r(cr'). We have that 7r“^(p) is an outcome of cr'. Since cr' is winning for O', 7r“^(p) belongs to 
O'. We have that 7r“^(p) A ^>^(5 x {0, T}) and by construction of S' this ensures that all edges that are 
taken belong to Ei and thus 7r“^(p) satisfies the condition G{Ei). 

• If 7r“^(p) \= GF{S X {T}) AMi then by construction of <5', an edge outside of Ej for some j ^ i is taken. 
This ensures condition F(^Ej) and therefore p belongs to O^. 

• otherwise 'K~^{p) \= (Aj/i 4 ‘i) and therefore p satisfies the condition G(£li)AMiA^p^j Mj a) 

and hence belongs to O^. 

This shows that the strategy 7r(cr') is winning for 11^ in G. 

□ 


This characterization immediately yields the decidability of the problem in PSPAGE. Moreover, PSPAGE- 
hardness follows from that of Muller games. We also provide a polynomial-time algorithms for the particular 
case of Biichi conditions. The following theorem proves the first statement of Theorem IH the second 
statement is proved afterwards (Theorem [7]) . 

Theorem 6. AA-synthesis in multiplayer Muller games is PSPACF.-complete and P-complete for Biichi 
objectives. 
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Proof. We use the result of [H Prop. 8] in the special case where n = 1, in which case the proposition 
says that for multi-player games with Muller objectives, checking whether Vali((Admi(G))igp, Sinit) = 1 is 
in PSPACE. By Lem. [TH with one call to this algorithm for each player, we are able to decide if AA has a 
solution and this proves PSPACE membership. For hardness we encode 2-players zero-sum Muller games in 
our setting: the first player keep the same objective and the second one is always winning. In this setting 
no strategy of the second player is dominated so finding an AA winning strategy for the first player is the 
same as finding a winning strategy in the original game. 

Biichi Case In the case of Biichi objectives, let us write = GF{Bi) and consider the objective Mi = 
GF(Ifi_i) => GF{Bi)AGF{Vi^o) (GF{Bi)\/GF(Hi)). In G, a run that verifies G{Ei) will either visit only after 
some point, or only after some point, or only 1^ o (see the proof of Lemma[3]for details). This means that 
Mi coincide with GF{{Vi^iABi)\/{Vi^oABi)V{VifiAHi)\/Vi^-i) on the language OutG(S)nG(iili). This is a Biichi 
condition. Note that in G', runs that take an edge outside of Ei end in the S x {T} component and fl' disallows 
such runs. Therefore, in the expression of fl', we can also replace M^ by a Biichi condition that we write 
GF{Bf^). By abuse of notation we will also write Bi for the states Bi x {T, 0, T} of the game G'. Consider now 
the objective fl', it is given by (GF(S x {0}) AGF(i?)'^) A GF{B^) GF{Bi x {T, 0, T}))) V (GF(S x {T}) A 
GF{B^)). Since in this game, states of SxT and SxT are absorbing (no play can get out of those components) 
we write an equivalent objective which is: {GF{B^ x {0}) A {l\j^iGF{B^) ^ GF{Bi))) V {GF{B^ x {T}). 
We define a (small) deterministic parity automaton A that recognizes this language. Its state space is 
({s,t,u,u} X {{j \ j € P \ {f}} U {T})), the transition relation is a product of transitions for the two 

SfxIO.T} -.Bf Bf\Bi , B. , . -s,x{0} . . BjX{0} 

components: s ->■ u, u ->■ t, t, u - t, u —>■ v, v ^ s, and j ->■ j, j - j where 

j' is j -F 1 if j -F 1 S P \ {f}, j -F 2 if j -F 1 = f and j + 2 €V, T otherwise, T —>• jo where jo is the smallest 
element oiV\ {z}. The coloring is defined by a function x where x(u, *) = 4 (where * is any possible second 
component), x({s,t,M},T) = 3), x(w,'P \ {z}) = 2, and for all other states s, x(s) = 1. A word is accepted 
by A when the maximal color appearing infinitely often is even. 

We show that a play of G' satisfies O' if, and only if, it is a word accepted by A. 

Let p be a play of G' which satisfies O', either it ends in the S x T component or the S x 0 component: 

• If p ends in the T component then the state of color 3 will not be visited infinitely often (we need 
to be in T states to progress on this component of the automaton). As p visits infinitely often B^^, 
the corresponding run in A will visit infinitely often zz, and therefore the maximal color that appears 
infinitely often is at least 2; and since it is not 3, it has to be even. 

• Otherwise p ends in the 0 component. Since p satisfies O', it visits Bj^ infinitely often and either there 
is a B^ for j ^ i that is not visited infinitely often, or p visits infinitely often Bi. 

— If there is a B^ for j ^ i that is not visited infinitely often, then the second component of A will 
get stuck at some point and its state T will not be visited infinitely often the state of color 3. As 
p visits infinitely often B^, the corresponding run in A will visit infinitely often zz, and therefore 
the maximal color that appears infinitely often is at least 2; and since it is not 3, it has to be 
even. 

— Otherwise p visits infinitely often Bi. Since we also visit Bj^ infinitely often, the run of A corre¬ 
sponding to p will reach infinitely often a state (z;, *) and therefore the maximal color occurring 
infinitely often is 4. 

This proves that the word is accepted by A. 

Now let p be a play of G' such that the corresponding word is accepted by A. If it is accepted then either 
the color 4 is seen infinitely often or the color 2 is and the color 3 is not: 

• If the color 4 is visited infinitely often then this means t is reached infinitely often, and because of the 
structure of A, u also is, which means both B^ x {0} and Bi occur infinitely often. This implies that 
the run p belongs to O'. 


26 







• Otherwise the color 2 is visited infinitely often and 3 is not. The states {*, T) are therefore not visited 
infinitely often (otherwise the maximal color would be 3 or 4. We deduce from that and the structure 
of A that some for j ^ i is not visited infinitely often. This means /\j^iGF{B^) is not true 
for p. Since the color 2 is seen infinitely often, this means u, * is seen infinitely often and therefore 
Bi X {0, T}. This ensures p belongs to fl'. 

This proves that a play of G' satisfy fl' if, and only if, it is a word accepted by A. 

Then solving the game G' with objective 17' is the same as solving it with objective given by A. This 
can be done by solving the parity game obtained by the product of G' with the automaton A. The obtained 
game is of polynomial size and the number of priority is 4, such games can be solved in polynomial time (see 
for instance [221128) 1 and therefore we can decide our problem in polynomial time. 

□ 


We are now interested in computing an AA-winning strategy profile. Thanks to Lemma ITfil we obtain an 
algorithm to compute AA-winning strategies by looking for winning strategies in G' and projecting them: 

Theorem 7. Given a game G with Muller objectives, if AA has a solution, then an AA-winning strategy 
profile ean be computed in exponential time. 

Proof. If AA has a solution, then by Lemma [1^1 there is a winning strategy for 17' in G'. This Muller game 
has polynomial size, hence we can compute a winning strategy tr' in exponential time (for instance in |26j the 
authors show that we can compute such a winning strategy via a safety game of size |S|!^). By Lemma fTHl the 
projection 7 r((T') is an AA-winning strategy. Doing this for each player we obtain a strategy profile solution 


D Complements on Abstraction (Section [7]) 

D.l Abstract Games 

The pair of abstraction and concretization functions (a, 7 ) actually defines a Galois connection'. 

Lemma 17. The pair ( 0 , 7 ) is a Galois connection, that is, for all S CS and T C S^, we have that a{S) C T 
if, and only if, S C ^(T). 

Proof. I => I Let s £ S. Since 7 defines a partition of S, there exists t £ such that s G 7 (t). By definition 
of a, a(s) = t. Assuming a{S) C T, we have that t £T. As s G 7 ( 1 ), we have s £ "i{T). 

I <= I If s^ £ a{S), then there is s G S' such that s^ = q;(s). Assuming S C 7(T), there is t £ T such that 
s = 7(7). By definition of a, we have that q:(s) = t. Therefore s® G T. 

□ 

We prove the soundness of the abstract arenas AP we defined, by showing that if coalition C achieves 
an objective in G, then it also achieves the objective in 

Lemma 5. If the coalition C has a winning strategy for objective (fk in G from s then it has a winning 
strategy for (Ak in A^ from a{s). 

Proof. Assume ac is a winning profile of coalition C, for objective fik in G. We define by induction a winning 
strategy tr^ in . We assume that Uq has been defined in a manner such that for each finite outcome 

h^ of shorter than some bound m, there is some h £ 7 (^ 1 ^) such that h is a, finite outcome of ac- The 
idea is then to define to resolve the determinism in a way which simulates the behavior from h. 

• If 'Si G Uie-p Si X Acti, then cr^{h^ ■ (last(/i^), a)) = 7 ( 7 ) where t = 6{\ast{h),a). 

• If G Uiec ■ (last(/i^), a) • s^) = uc{h ■ 6{\ast{h), a)). 
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With this definition, our induction hypothesis will be respected for histories containing one more step, 
and therefore this holds for all histories. Let now p® be an outcome cr^. By the way we defined this strategy 
there is a run p outcome of (Jq such that p G 7 (p^). As (Jq is winning, p satisfies the Muller condition (j)k and 

since 7 is compatible with players’ objectives, p® satisfies 0^. Which show that C has a winning strategy in 
Q3,k,C 

□ 


D.2 Abstract Assume-Admissible Synthesis 

Value-Preserving Strategies We provide the proofs of the lemmas stated in the core of the paper. 
Lemma 6. For all players k and x G {—1,0,1}, C Vk^x C 'y{Vk,x)- 


Proof. 


Vk. 


Vk,-i 


This is a direct consequence of Lemma [SJ 


If s G Vk^-i then the coalition V has no winning strategy in G. By determinacy, the empty 

coalition has a strategy to ensure -<(j)k- Therefore by Lemma [SI the coalition 0 has a strategy in from 
a(s) that ensures -^(fk- Therefore s G ^{Vk-i)- 

Recall that Vk,o = VVin-p\{fe}(A,-i^^) n Win-p(A, Let s be a state in Vk,o. By LemmajSl a{s) 


Vk.i 


belongs to both sides of the intersection, thus a{s) G Vk,o. Thus 14, 0 C jiVk.o)- 

If G V_f. I then the coalition V \ {k} has no strategy in for -'(!)%■ Therefore by LemmajSl 


V 


■k.l 


it has no strategy in A from any state of 7 ( 5 ^) to do so. Therefore k has a winning strategy in A from 7 (s^), 
an d 7 ( 5 ^) G Vfe,i. 

If G V_k _i, then the coalition V has no winning strategy in AJ^ for objective (j}%. Therefore 


V 


.k,-l 


by LemmajSl it has no winning strategy in A from 7 ( 5 ^) neither for the objective (fk- This means that 
7 ( 5 ^) G Vk-i- 

Note that by definition of the lyX. operator, V_k 0 Q F. Thus, let us just show that 7 (A) C 14,q. 




Recall that 14 ,0 = \N\n-p\[k}{^,~'(l^k) r\W\n'p{A,(l)k)- Let s G 7(14fco)- Then player k has no strategy 
in A^^^ for (/)|, hence, by LemmajSl it cannot win A neither for (fk from 7(3). This shows that 7(5) C 
Win-p\{fc} (A, Furthermore, the coalition 0 has no strategy in AJ^ for thus it does not have one 
neither in A for ^( j>k from 7(5). In other terms, 7(5) C Win-p(A,0fc). 


□ 


We show that when playing according to player k ensures staying in V. This is proven in the 
following. Let us write 7 (f) = {(s, a) | (a(s), a) G £} for £ G (Aj., Ek}- 

Lemma 18. For all games G, and players k, 1. 7 (Aj, fl ( V x Act)) C Ek Q 'y{Ek). 2. For all s^ G S^, there 

exist a, a' G Actk such that (s^,a) G Aj, and (s^,a') € Ek- 3. For all (s^,a) G with s^ G F, we have 

post^{s^, a) C y_. 

Proof. The inclusion Ek Q l{Ek) follows from the definition of Ek, and by LemmajSl It also follows that 
for all s G S^, there is {s,a') G Ek, since this is always the case for Ek. 

Let (s^,a) be an edge in A;. (7 (F x Act). Let s be a state in 7 ( 3 ®). We have that s G 'y{V_k,x) some 

X G {—1,0,1} and by Lemma [ 6 ] s G Vk,x- By definition of Ek, for all E such that A{s^,a,E), E G F^. ^ 
with I > X and €V_k^. By Lemma jH we have that the value of all states in ')[E) are at least as great 
as any state in 7 ( 3 ^). By definition of A, a{6{s,a)) C {E \ A{s^,a,E)}. Therefore a(d( 3 , a)) G Ui>xY_ki, 
which means S(s,a) G Ui>x 7 (Ui>xVk i) C Ui^xVk,! using Lemma jH By definition of Ek this implies that 
(3,0) G Ek. 

It remains to prove that for all 3 ^ G S|, there is ( 3 ^, a) G E_k, and that if 3 ^ G F, then for all ( 3 ^, a) G E_k, 
A{s^,a,E) implies E G F. 

If 3 ^ G \ F, then ( 3 ^, a) G Aj. for all a G Actfe by definition. Let us now assume 3 ^ G F- 
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• If S Xfe.-i; then By definition oiV_f. _i, we have that for all actions a, and all states if A^(s®, a, t®) 
then € Y.k- 1 - Thus (s^, a) G and G y_k-i for any such so G V_. 

• If s G then there exists a such that (s^,a,t^) G implies G So (s^,a) G and 

G Moreover this holds for all a with (s^,a) G E_j., since for such a, G A® implies 

G 1 by definition of 

• If s G ]£fe,o> then by the greatest fixpoint defining V_i. q, there exists a G Actfc such that for all with 

A(s^,a,<^), G y_f. Q. Conversely, for all (s^,a) G Ej., a ensures staying inside q U Thus for 

any such a, {s^,a) G and any E, A{s^,a,E) means E G y_k,o- 

□ 

Recall that does not constrain the actions outside the set R; thus strategies in Strata can actually 
choose dominated actions outside V_. To prove that Stratfe(^j,) is an under-approximation of Stratfe(iffe) when 
started in V, we need to formalize the fact that admissible strategies may choose arbitrary actions at states 
that are not reachable by any outcome. Intuitively, such strategies cannot be dominated since the dominated 
behavior is never observed. 

For any strategy ct, let Reach (G, ct) denote the set of states reachable from sinit by runs compatible with a. 
We show that if one arbitrarily modifies an admissible strategy outside the set Reach(G, cr), the resulting 
strategy is still admissible. 

Lemma 19. Let a be a strategy in Admi(G) and a' a strategy in Ei(G). If for all histories h such that 
last(/i) G Reach(G,CT), ^ cr(h) = <j'{h), then a' G Admi(G). 

Proof. For all profiles a-k G E_fe(G), we have OutG(cr_fe,CT) = OutG(cr_fe, cr') so if a' is dominated, then a 
would also be dominated, which is a contradiction. 

□ 

Lemma 7. For all games G, and players k, Stratk{Ek) C 7 (Stratfc(iffe)), and if Sinit G 7 (R), then 0 ^ 
7(Stratfe(,Ej,)) C Stratfe(£;fe). 

Proof. Since Ek C ^{Ek) by Lemma [T51 we have Stratfc(£'fe) C 7 (Stratfe( 7 (iffe))). 

Assume Smit G 7 (R). The fact that Stratfc(A^), thus also 7 (Stratfe(,E^)) are non-empty follows from 
Lemma ITSl too, since for any state there is a G Actfe with (s^,a) G 

We prove that Reach(M^^f*f, cr) C V_ for all a G Stratfe(^;.). We already know, by Lemma fT^ that for 
all s^ G R, if (s^,a) G then all successors P with A{s^,a,P) satisfies P G V). We are going to show that 
for all s^ G V) n with j ^ k, for all a G Actj, A^(s^, a, P) implies P G R. 

Consider s^ G F. If s^ G then for all a G Act, A^{s^,a,P) implies that P G since V \ {k} 

resolves non-determinism. The situation is similar if s^ G V.k-i', for all a G Actj, A^(s^,a,P) implies 
P G y_f. If s® G V_f. Q, then, by the definition of the outer fixpoint, for all a G Actj, A^{s^,a,P) implies 
that P G V_. 

Thus Reach(A^''^^^, ct) C F for all a G Strat/c(A;.). It then follows that Reach(G, 7 ( 0 -)) C 7 (F). So, by 
Lemma and by the fact that C Ek, all strategies in 7 (Stratfe(,Ej,)) are value preserving, which is 

to say, belong to Strata (iffc). 

□ 


Help States 

Lemma 20. For all players k, 'y(H,A C Hk C "f[Hk). 

Proof. Let s^ G H_k^ let a,b € Act two witnessing actions. For all s G 7(5^), we have S{s,a) G 
7(post^(s^, a)) C Vkfi U Ffcp and S{s, b) G 7(post^(s^, a)) C Vk^ U F^p. Moreover a{6{s, a)) G post^(s^, a), 
a{S{s,b)) G post^(s^, 6 ), and post^(s^,a) fl post^(s^, 6 ) = 0, therefore a{S{s,a)) ^ a{S{s,b)) and thus 
S{s,a) ^ S{s,b). Hence s G Hk. 
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Now, consider any s € Hk] and let a,b G Act be such that 6{s, a), S{s, h) G 14_o and (5(s, a) ^ S(s, b). 

If we write = a(^(s, a)) and = a((5(s, b)), then G Vk,o U Vk,i, and A(s^, a, t^), and A(s^, 6 , m®); 

thus a(s) G Hk- It follows that Hk Q □ 

Synthesizing AA-winning Strategies 
Lemma 21. We have 7 (M^) C C 

Proof. We have 'y{<t>^) = 4>k by assumption on 7 . Thus, by Lemma [51 

'f{{GF{Vk,i) => C GF(I4,i) ^ (fk) C 7((GF(I£fe4) (fl)). 

Similarly, by Lemma[20l we get 7 (GF(yfc,o) ^ V GF(iJfc))) C GF(I4,o ^ {(l)k^GF{Hk)) C 7 (GF(I/j, q) ^ {(fl V GF{Hk))) ■ 
It follows that 'y{MLk) ^ ^k — l{M'k). 

□ 


The following lemma proves Theorem [5l 

Lemma 22. Let k G P he a player and cfk a strategy of player k. G y_, and ak is winning for objective 

n' i. in A'f., then 7 ( 0 '^) is winning for flj, in G^. 

Proof. Let us rewrite 


nh = M', A 


GF(S" X {0}) A (/\ M'j 



VGF(S" X {T}) 


Let Ok be a winning strategy in for We will show that G',j{ak) |= 

Consider any run p of G'^, compatible with j{crk). By definition of 'y{o'k), ol{p) is a run of A'^. compatible 
with (Tfc. Since Ok is a winning strategy, a[p) G M ' and by Lemma ITU o G M^. 

We now show that p G GF(S x {0, T}). By assumption, we have A!i^,<Jk |= GF(S^ x {0, T}), which means 
that for all histories of A'^. compatible with ak, {\ast{h^),a{h^)) G E_f. (otherwise the transition relation of 
A'^ would lead to a T state). Moreover, since G V_, it follows from Lemma[7]that {\ast{h),^{a){h)) G Ek 
for all histories h compatible with 7 (crfc). Thus no state {*, T) is reachable under ^{a) in G(.. 

Because of the structure of GJ. this means that p either visits states of S x {0} or states of S x {T} 
infinitely often: 

• If p £ GF(S X {0}), then a{p) G GF(S® x {0}); so a{p) G Aj/fc ^ follows, by Lemmaand 
the compatibility of the abstraction with players’ objectives, that p G Aj//c ^ 4’k. Thus p G fl).. 

• Otherwise p G GF(S x {T}), so p G 

Thus any outcome p of '){ak) belongs to which shows it is winning. □ 


E Algorithm for Assume-Guarantee Synthesis 

The assume-guarantee-A rule was studied in for particular games with three players. However, the 
given proofs are based on secure eguilibria which do not actually capture assume-guarantee synthesis, so 
the correctness of the algorithm is not clear. Here, we give an alternative algorithm for deciding assume- 
guarantee-A for multiplayer games, and prove its correctness. 

For any game G, and state s, we denote by Gs the game obtained making s the initial state. Assuming 
that for each player i has an objective (pi which is prefix independent, let us define Wi = {s G S | dcr^. G^, ( 7 ^ |= 
Ajg-p\{i} 4>j A}- 

The following lemma gives a decidable characterization of assume-guarantee synthesis: 
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Lemma 23. Let (pi be a prefix-independent objective. Rule AG^ has a solution if, and only if, there is a 
run p which visits only states Wi and such that p |= Aigp 'hi¬ 

proof. I => I Let cr-p be a solution of AG^. Let p be its outcome. We have that p \= Ai^-p 4‘i by hypothesis 
of AG^. Let i be a player, we show that p only visits states of Wi. This is because ai is winning for 
A_,ep\{i} 4>j 4>i- Foi' 3-11 k, p<k is a finite outcome of Oi, and the strategy played by Oi after this history is 

winning for Aj^v\{i} ^ which means that pk belongs to Wi. Hence p satisfies the desired conditions. 

I I If there is such a run p, we define the strategy profile ap to follow this run if no deviation has 
occurred and otherwise each player i plays a strategy which is winning for Aj^v\{i} '^3 ^ possible. We 
show that such a strategy profile satisfies the assumption of assume-guarantee. Obviously <jp |= Ai^vA- 
Let p' be an outcome of Ui and k the first index such that 7 ^ pk. The state p'k-i = Pk-i is not controlled 
by player i, because ai follows p. As pk-i is in Wi and not controlled by player i, this means that p'^, £ Wi. 
Therefore ai plays a winning strategy from pj, for the objective Aj^'p\{i} thus p' satisfies this 

objective. Hence ap is a solution of AG^. 

□ 

We deduce a polynomial-space algorithm for the AG^ rule with Muller objectives: 

Theorem 8. For multi-player games with Muller objectives, deciding whether AG^ has a solution is PSPACE- 
complete. 

Proof. The algorithms proceed by computing the set Wi for each player i with an algorithm that computes 
winning regions and then checks whether there is an infinite run in the intersection fjiep bPi which satisfies 
Ajgp (pi- This algorithm is correct thanks to Lemma [531 

This is in PSPACE because the objective Aj£P\{i} ^3 ^ A can be expressed by a Muller condition 
encoded by a circuit [20] of polynomial size. We can decide in polynomial space if a given state is winning 
for a Muller condition given by a circuit. Thus, the set C\i^p Wi can be computed in polynomial space; let 
us denote by G' the game restricted to this set. The algorithm then consists in finding a run in G' satisfying 
Ai^P A) that is, finding a run satisfying a Muller condition, which can be done in polynomial space. 

□ 
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